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During 1970 Katser announced and later clarified (Kaiser and Rice, 1974) 


the development of an index which provides in format on regarding the psycho- 


metric adequacy of a data set under consideration for factor analytic methods, 


His Measure of Sampling Adequacy (MSA) indexes the degree to which the vari- 


ables in one's investigation comprise an adequate sample from the domin of 


interest. The overall MSA is defined as: 


xxrꝰ jx 
us IAK 
rrrzzu + Lr: jk 
11 j#k 


where the r2 jk is a squared off diagonal element of the original correlation 


matrix and 4730 is a squared off diagonal of the anti-image correlation matrix, 


sR 18. me matrix 82 is defined as [tag R 


sige According to present cali- 
brations, 


MSA lies between zero and one with values in the .80's and . 90 


Signaling data sets which are really appropriate for factor analytic methods, 


If one shoul, encounter an overall MSA close to 50, it should be clear that 


the original and anti-image correlations are close to equal (rrr? zu = rrqꝰ2 4k). 
#k 


J J 
This means that the intercorrelations of the sample anti-image parts of the 


data are much too large for factoring methods to be considered. (They are 


uncorrelated in the Population). 


MSA appears to improve Molding the others constant) as: 


(1) The number of variables (NV) increases 

(2) The number of subjects (SS) increases 

(3) The general level of correlation (LC) increases, and 
(4) The effective number of factors decreases 


Several studies have been conducted with 


demonstrated (19744, 1974B) that MSA wou ld 


and of MSA. Dziuban and Shirkey 


signal random variables ina 


mixed set (Shaycofe, 1970) and that it would "flag" a correlation matrix 
based upon random numbers and a sample size of fifty (Armstrong and Soelberg, 
1968). They (Shirkey and Dziuban, 1976) conde ted a study of the index with 
sample co:relation matrices from populations in which the variables were 
uncorrelated. Systematic variation of sample size (SS) and number of vari- 
ables (NV) failed to produce sample-based MSA's which would signal one to 
proceed with factoring mechods. They concluded that there exists little 
danger that the MSA would erroneously lead one to factor a sample correlation 
matrix drawn from a population in which the variables were uncorrelated. 
Cerny and Kaiser (In Press) con:z!ved matrices which approximated actual 
sample correlation matrices. They systematically varied the number of vari- 
ables (p), the number of factors (q), and the root mean square off diagonal 
correlation (f). They found :hat the MSA was most sensitive to the number of 
variables, then the number of factors, and finally the level of correlation. 
Cerny and Kaiser reported that they with (In Press) Meyer investigated the 
properties oi MSA in the Spearman case as a function of the number of 
variables and magnitude of correlation. They found that MSA neared .5 when 
the common off diagonal correlation approached zero and the number of vari- 


ables approached two. 


Procedure and Results 5 


Sample correlation matrices were created using the Browre (1968) 
algorythm. The procedure allows one to generate cvrrelation matrices of 
specified sample size (38) and number of variables (NV) from a multivariate 


normal population of a specified correlational level. Ten matrices 


— hd 


„ ‘ 3 
(3 x 3 x 5) were generated for each one of the one hundred and twenty-five 
SS-NV-LC combinations--a total of 1,250. 
“BS = 50 100 250 500 1000 
NV= 5 10 15 20 25 
Lo- 1 3 3 7 9 
The MSA for each cell of the design was determined. A portion of those means 
is presented in Table One. 
The results of an analysis of variance (Table One) revealed significant 
(P < .I) main effects for number of variables (NV) and level of correlation 
(LC) as well as their interaction. The strength of association (w?) was 
determined for each significant F ratio. Approximately sixty-eight percent 
of the MSA variance (o? = .677) was accounted for by the tae of variables 
(NV) and eighteen percent Go? = .178) attributable to level of correlation 
(LC). The interaction NV x LC accounted for eight percent of the total 
variance (w* = .086). 
Insert Table One About Here 
The overall means for the three factors crossed two at a time are pre- 
sented in Table Two. It msy be observed that there was a substantial increase 
in the MSA for all correlation levels (LC) as the number of variables (NV) 
increased from five to twenty-five. The nature of that trend may be observed 
in Graph One. The greatest overali increase was obtained for LC = .1; MSA 
went from .626 to .956. The increase was not as great for the remaining levels 
of correlation but the trend continued. At all levels of correlation for 


NV = 25, the MSA's were substantial (in the high .90's). 


5 


Graph Two portrays the change in MSA across level of correlation (LC) 
for the number of variables (NV). In each case there was an initial increase 
when LC changed from .1 to .3. For NV = 5 the maximum MSA (.830) was realized 
at LC = .7. In each of the other cases the highest value wax achieved at a 
level of correlation equal to .5 with slight decreases thereafter. Those 
decreases, however, would not change the interpretation of MSA levels under 

ö the present calibration. 

From Tables One and Two it may be noted that sample size had a minimal 
effect on the index. The overall increage in MSA due to sample size (SS), 
Table Two, was from .909 to .911. The increase for number of variables was 
from .777 to .973. Level of correlation resulted in an increase from . 839. 


to .911. 


Discussion ö l 


We have found that other things being equal the Kaiser-Rice Measure of 


f 


Sampling Adequacy tends to be most sensitive to the number of variables (p). 
Since it is well knowm that many characteristics of factor analytic solutions 
are related to p,“ our results are not surprising. Increasing the number of 
variables (NV) from five to twenty-five yields an MSA in the aid to high 

-90's. With NV = 25, LC = .1 and SS = 50, our MSA was .966. This was quite 

5 substantial by present calibration stau<:rds. 
The MSA is also affected by the general level of correlation of the matrix. 

The nature of that relationship indicates that increasing LC at the lower 
levels (.1 to .3) results in a substantial increase in the index. When one 

has variables correlated approximately at the .5 level, however, the maximum 


possible MSA will have been achieved. 
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Ie is somewhat surprising that the sample size (SS) did not have a 
stronger effect. It may be that MSA only gets depressed when the sample 
size (SS) gets down close to the number of variables. This should be 
verified. In the present study even the smallest sample size (SS = 50) 
was quite a bit larger than the largest number of variables (NV 25). 

It is rare, however, to find practicing factor analysts who work with 
sale sizes close to fifty, although recently Pruzek (1975) has indicated 
that small sample factor analysis is indeed a possibility. 

It appears from these data that one must expend considerable effort to 
achieve an MSA as low as . 30. Culy In the case where the variables are 
literally unrelated is the MSA likely to approach .50. The lowest cell . 
mean we observed was for LC = ol, NV = 5, SS = 55, MSA = .577. To encounter 
an MSA in the .50's should be clear warning not to proceed with factor 


analytic methods. Quite possibly matrices which yield values in the .60's 


not so hot“ - should be given serious consideration. 8 


Given the three main ef facts we have studied, the MSA has performed 
very predictably with a small surprise regarding the effect of sample size. 
These results suggest that, if one uses the MSA as the basis of a decision 
rule for the psychometric quality of his data set, factor enalytic practice 


will be improved. 
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TABLE ONE 


Results of the Analysis of Variance 


Subjects 


Number of Variables (NV) & 6.3806 1.5951 4077.1°43 


Level of Correlation (c) 4 1.6632 4158 1062. 7622 


Sample Size (88) 4 „0005 0001 23595 


NV x LC 16 8108 0506 129.5360 


NV x SS 16 0088 0005 1.4148 


LC x SS 16 20024 0001 3878 


NV x LC x SS 64 0287 000 1.161 


839 927 0937 2932 0919 


909 911 911 2910 910 
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